Goodness-of-fit and confidence intervals of approximate models
نویسندگان
چکیده
To test whether the model fits the data well, a goodness-of-fit (GOF) test can be used. The chi-square GOF test is often used to test the null hypothesis that a function describes the mean of the data well. The null hypothesis with this test is rejected too often, however, because the nominal significance level (usually 0.05) is exceeded. Alternatively, the level of Hotelling’s test is accurate if a fixed hypothesis for the mean is available. In many situations, however, only an estimate of the mean is available, and so the level of Hotelling’s test may also be incorrect. An approximate version of Hotelling’s test is suggested as a GOF test. It is shown that this requires only an adjustment of the degrees of freedom of Hotelling’s original test. GOF tests assume that the model is either correct or incorrect whereas in model specification it is often assumed that the model is an approximation. Consequently, for approximate models a GOF test will mostly indicate that the model does not fit. It is therefore suggested that a measure of approximation to the true model could be used to get an indication of how bad the approximate model is. It is also shown that correct confidence intervals can be obtained from when using an approximate model. The results are applied to data from the daily news memory test.
منابع مشابه
Exact Statistical Inference for Some Parametric Nonhomogeneous Poisson Processes
Nonhomogeneous Poisson processes (NHPPs) are often used to model recurrent events, and there is thus a need to check model fit for such models. We study the problem of obtaining exact goodness-of-fit tests for certain parametric NHPPs, using a method based on Monte Carlo simulation conditional on sufficient statistics. A closely related way of obtaining exact confidence intervals in parametri...
متن کاملMonte Carlo Comparison of Approximate Tolerance Intervals for the Poisson Distribution
The problem of finding tolerance intervals receives very much attention of researchers and are widely used in various statistical fields, including biometry, economics, reliability analysis and quality control. Tolerance interval is a random interval that covers a specified proportion of the population with a specified confidence level. In this paper, we compare approximate tolerance interva...
متن کاملAn Updated Review of Goodness of Fit Tests Based on Entropy
Different approaches to goodness of fit (GOF) testing are proposed. This survey intends to present the developments on Goodness of Fit based on entropy during the last 50 years, from the very first origins until the most recent advances for different data and models. Goodness of fit tests based on Shannon entropy was started by Vasicek in 1976 and were continued by many authors. In this paper, ...
متن کاملEstimation of reliability-based maintenance time intervals of Load-Haul-Dumper in an underground coal mine
Reliability estimation plays a significant role in the performance assessment of mining equipment, and aids in designing efficient and effective preventive maintenance strategies. Continuous and random/irregular occurrence of failures in a system could be the main cause for performance drop of machinery. The accomplishment of a projected level of production is possible only by an efficient oper...
متن کاملParameter Estimation in Astronomy with Poisson-distributed Data. Ii. the Modified Chi-square-gamma Statistic
I investigate the use of Pearson’s chi-square statistic, the Maximum Likelihood Ratio statistic for Poisson distributions, and the chi-square-gamma statistic (Mighell 1999, ApJ, 518, 380) for the determination of the goodness-of-fit between theoretical models and low-count Poisson-distributed data. I demonstrate that these statistics should not be used to determine the goodness-of-fit with data...
متن کامل